Nonparametric Regression ( and Classification ) Statistical Machine Learning , Spring 2017

نویسنده

  • Larry Wasserman
چکیده

• Note for i.i.d. samples (xi, yi) ∈ R × R, i = 1, . . . , n, we can always write yi = f0(xi) + i, i = 1, . . . , n, where i, i = 1, . . . , n are i.i.d. random errors, with mean zero. Therefore we can think about the sampling distribution as follows: (xi, i), i = 1, . . . , n are i.i.d. draws from some common joint distribution, where E( i) = 0, and yi, i = 1, . . . , n are generated from the above model • It is typical to assume that each i is independent of xi. This is a pretty strong assumption, and you should think about it skeptically. We too will make this assumption, for simplicity. It should be noted that a good portion of theoretical results that we cover (or at least, similar theory) also holds without this assumption

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Nonparametric Models

A Bayesian nonparametric model is a Bayesian model on an infinite-dimensional parameter space. The parameter space is typically chosen as the set of all possible solutions for a given learning problem. For example, in a regression problem the parameter space can be the set of continuous functions, and in a density estimation problem the space can consist of all densities. A Bayesian nonparametr...

متن کامل

Comparison of classic regression methods with neural network and support vector machine in classifying groundwater resources

In the present era, classification of data is one of the most important issues in various sciences in order to detect and predict events. In statistics, the traditional view of these classifications will be based on classic methods and statistical models such as logistic regression. In the present era, known as the era of explosion of information, in most cases, we are faced with data that c...

متن کامل

An empirical evaluation of easily implemented, nonparametric methods for generating synthetic datasets

When intense redaction is needed to protect data subjects’ confidentiality, statistical agencies can release synthetic data, in which identifying or sensitive values are replaced with draws from statistical models estimated from the confidential data. Specifying accurate synthesis models can be a difficult and labor intensive task with standard parametric approaches. We describe and empirically...

متن کامل

Nonparametric statistical analysis for multiple comparison of machine learning regression algorithms

In the paper we presented the guidelines for the application of nonparametric statistical tests and post hoc procedures devised to perform multiple comparisons of machine learning algorithms. We emphasized it is necessary to distinguish between pairwise and multiple comparison tests. We showed that the pairwise Wilcoxon test, when employed to multiple comparisons, would lead to overoptimistic c...

متن کامل

Nonparametric Regression for Learning

In recent years, learning theory has been increasingly influenced by the fact that many learning algorithms have at least in part a comprehensive interpretation in terms of well established statistical theories. Furthermore, with little modification, several statistical methods can be directly cast into learning algorithms. One family of such methods stems from nonparametric regression. This pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017